Get the best from substructure mining

نویسنده

  • Jeroen Kazius
چکیده

The chemical information that is present in a set of compounds is rarely fully exploited. This is mostly because no descriptor set can capture all biologically important features. As a result, valuable chemical knowledge can thus stay hidden from hypothesis-based drug design. The simplest form of a structure-activity relationship (SAR) is a substructure that predisposes compounds towards reduced or increased biological activity. Such simple patterns should not be missed during drug design. The aim of substructure mining is to present those substructures that are most likely related to biological activity. This method thus provides rapid access to a substantial repertoire of chemical descriptors that otherwise remains hidden: substructures. In short, substructure mining consists of a focused, but exhaustive, series of substructure searches. This poster describes how AweSuM, the new Awesome Substructure Mining tool from Curios-IT, was employed to learn the most interesting substructures. The poster also discusses the value of enriching the data with 2D pharmacophore information prior to mining. An enriched, detailed SAR analysis produced a scaffold that summarises the chemical content of datasets better than any standard substructure. The pharmacophore that AweSuM extracted shows predictive power and agrees with published chemical knowledge. These results demonstrate that useful SAR knowledge can be extracted from the vast space of substructure descriptors. In this way, AweSuM reveals key substructures (e.g., pharmacophores or toxicophores), which can often be predictive for biological activities.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Interesting Aspects of a Product using Aspect-based Opinion Mining from Product Reviews (RESEARCH NOTE)

As the internet and its applications are growing, E-commerce has become one of its rapid applications. Customers of E-commerce were provided with the opportunity to express their opinion about the product on the web as a text in the form of reviews. In the previous studies, mere founding sentiment from reviews was not helpful to get the exact opinion of the review. In this paper, we have used A...

متن کامل

Substructure Mining Using Elaborate Chemical Representation

Substructure mining algorithms are important drug discovery tools since they can find substructures that affect physicochemical and biological properties. Current methods, however, only consider a part of all chemical information that is present within a data set of compounds. Therefore, the overall aim of our study was to enable more exhaustive data mining by designing methods that detect all ...

متن کامل

A Graph-based Interaction Pattern Discovery for Human Meetings

Mining Human Interaction flow in meetings or general representation of any interaction face to face to meetings is useful to identify the person reaction in dissimilar situation. Activities represent the natural history of the individual and mining methods help to analyze how person delivers their opinion in different ways. Meeting interactions are categorized as propose, comment, acknowledgeme...

متن کامل

Structure Discovery in Sequentially Connected Data

Much of current data mining research is focused on discovering sets of attributes that discriminate data entities into classes, such as shopping trends for a particular demographic group. In contrast, we are working to develop data mining techniques to discover patterns consisting of complex relationships between entities. Our research is particularly applicable to domains in which the data is ...

متن کامل

A New Approach to Protein Structure Mining and Alignment

One of the largest areas of bioinformatic and data mining research has been in the protein domain. These efforts have included protein structure prediction, folding pathway prediction, sequence alignment, ab initio simulation, structure alignment, substructure detection and many others. Substructure detection is generally defined as the mining of a molecule’s 3D structure in order to find inter...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2010